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IMAGE  UNDERSTANDING  AND  INFORMATION  EXTRACTION 
RESEARCH  OVERVIEW  AND  SUMMARY 

Since  October  1,  1973,  we  have  been  supported  by  ARPA  IPT  office  to  carry 
out  research  In  Image  analysis  and  modeling.  More  recently,  the  emphasis  of 
our  research  has  been  shifted  toward  achieving  a better  understanding  of  Image 
structure  and  using  this  knowledge  to  develop  techniques  for  extracting  In- 
formation from  Images.  To  reflect  this  new  emphasis,  we  have  changed  the  title 
of  our  program  to:  Image  Understanding  And  Information  Extraction. 

The  ultimate  goal  In  automatic  Image  understanding  and  Information  ex- 
traction Is  to  build  a machine  which  will  look  at  a scene  and  then  give  us  a 
qualitative  as  well  as  quantitative  description  of  It.  This  formidable  task 

can  be  broken  down  Into  several  sub-tasks  as  Indicated  by  the  block  diagram  In 
Fig.  1. 

The  sensor  looks  at  the  scene  and  outputs  Image  data  which  are  generally 
mult  I spectral  and  multitemporal.  The  preprocessing  box  attempts  either  co  put 
the  Image  Into  a form  which  Is  more  suitable  for  analysis  by  the  latter  boxes 
or  to  compress  the  Image  data  for  transmission  to  a remote  location  or  storage 
for  future  use.  The  Image  segmentation  bo..  In  simple  Instances,  locates  ob- 
jects In  the  Image  (for  example,  In  character  recognition,  It  locates  the 
characters).  For  more  complex  scenes,  It  segments  the  Image  Into  regions. 

Each  of  these  regions  Is  classified  by  the  classification  box.  The  classifi- 
cation can  be  done  either  by  classical  decision-theoretic  methods  (feature  ex- 
traction followed  by  statistical  classification)  or  by  the  more  recent  syn- 
tactical methods.  In  linguistic  terminology,  the  regions  are  primitives,  and 
the  classifier  finds  attributes  for  these  primitives.  Finally,  the  structural 
analysis  box  attempts  to  find  the  relationship  (spatial,  temporal,  spectral) 
among  the  regions  (primitives).  In  some  cases,  one  may  want  to  develop  a 
grammar  for  the  primitives. 
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The  functions  of  the  boxes  In  Fig.  1 overlap,  and  their  demarcation  Is 
not  clear  cut.  <’  thermore,  there  Is  strong  feedback  In  the  system  (as  In- 
dicated by  the  Oa^.ed  arrcv/s  In  the  block  diagram).  For  example,  structural 
analysis  may  reveal  that  certain  regions  are  classified  wrongly  and  thus  force 
a reclassification.  Similarly,  difficulties  encountered  in  classification  may 
make  It  necessary  to  reexamine  image  segmentation.  Etc.  We  aim  to  study  not 
only  the  individual  boxes  in  Fig.  1,  but  also  their  interaction. 

We  are  carrying  out  both  basic  and  applied  research  with  the  hope  that  on 
the  one  hand  the  results  of  our  basic  research  could  find  practical  applica- 
tions, and  on  the  other  hand  problems  arise  from  practical  situations  would 
guide  the  direction  of  our  basic  research.  Our  emphasis  in  both  basic  and 
applied  research  is  on  the  syntactic  decomposition  and  recognition  of  imagery. 
The  data  base  we  use  includes  but  Is  not  limited  to  aircraft  and  satellite 
multispectral  Imagery. 

Our  research  projects  fail  into  five  overlapping  categories. 

A.  IMAGE  PRIMITIVES;  Segmenting  images  Into  regions  (primitives) 

We  investigate  two  approaches  to  image  segmentation:  edge  de- 
tection, and  region  growing. 

(1)  Edge  Detection 

Many  edge  detection  techniques  have  been  studied.  However,  none  of 
them  works  well  for  noisy  and  unsharp  images.  There  are  two  difficul- 
ties. First,  noise  gives  rise  to  false  edge  points.  Second,  noise  and 
Image  blurring  make  it  very  hard  to  connect  edge  points  to  form  con- 
tinuous curves. 

The  main  thrust  of  our  research  is  to  use  syntactic  methods  to  help 
extract  connected  edge  curves  in  a noisy  and  unsharp  environment.  We 
propose  to  employ  a "two-level11  scheme  to  detect  edges  where  there  is 
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a-prlor!  knowledge  of  some  structure  In  a pattern.  The  basic  Idea  Is  to 
separate  the  problem  of  detecting  the  edges  In  noise  from  that  of  de- 
tecting the  edge  structure,  but  to  Improve  the  efficiency  of  each  solu- 
tion by  transferring  Information  from  one  to  another.  To  locate  noisy 
or  "smeared"  edges,  a parser  for  a stochastic  string  generating  grammar 
will  be  used,  but  Its  operation  will  be  controlled  by  a web  grammar 
parser  that  recognizes  the  edge  structure.  This  assumes  a parallel  pro- 
cessing capability  so  that  the  two  levels  may  proceed  simultaneously  and 
guide  each  other.  The  use  of  stochastic  web  grammars  would  not  be  In- 
tended to  handle  "local"  noise  and  smearing,  but  rather  any  such  phe- 
nomena which  have  a structural  character  (e.g.,  edges  obscured  by 
shadows) . 

In  order  to  use  syntactic  methods  to  help  edge  detection  In  a 
digital  environment,  It  Is  necessary  to  understand  the  nature  of  digi- 
tized edges.  We  are  theiefore  studying  the  characterization  of  digital 
edges,  especially  straight  edges  and  lines.  Some  properties  of  Ideal 
digital  straight  lines  and  a method  of  detecting  them  are  discussed  In 
the  report  by  Tang  and  Huang.  Currently,  we  are  experimenting  with  real 

digital  edges,  l.e.,  edges  obtained  by  digitizing  real-life  Images  which 
contain  blurring  and  noise. 

Another  Important  problem  we  are  studying  that  Is  closely  related 
to  edge  detection  Is  mensuration.  In  many  reconnaissance  applications, 
one  Is  often  Interested  In  measuring  the  distance  between  two  edges. 

Here,  It  Is  of  the  utmost  Importance  to  dekerm!ne  edge  locations  ac- 
curately. We  have  developed  an  efficient  algorithm  for  finding  the 
maximum  a posteriori  estimates  of  edge  locations  which  works  very  well 
In  our  preliminary  tests  (see  the  report  by  Burnett  and  Huang).  Cur- 
rently we  are  testing  the  algorithm  on  aerial  photos. 
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(II)  Region  Growing  Techniques 

The  BLOB  algorithm  of  Wlntz  and  Gupta  works  quite  well.  However, 
because  of  Its  sequential  nature,  the  resulting  boundaries  depend  on  the 
order  In  which  the  picture  elements  are  processed.  For  example.  If  we 
process  the  Image  In  the  conventional  line  scanning  order,  then  we  tend 
to  get  many  more  vertical  than  horizontal  boundaries. 

To  remedy  this,  we  propose  to  develop  parallel  segmentation  tech- 
nlques.  One  approach  Is  to  first  divide  the  Image  Into  say  2x2  picture 
element  regions,  then  we  merge  together  regions  with  similar  character- 
istics (e.g.,  spectral  contents,  mean  and  variance  of  the  gray  levels, 
or  more  generally,  texture)  In  a parallel  manner.  All  pairs  of  neighbor- 
ing regions  are  examined  simultaneously,  and  the  boundary  between  a pair 
Is  removed  If  the  two  regions  are  similar.  This  process  Is  repeated 
until  all  neighboring  regions  are  dissimilar. 

A more  efficient  Initialization  (l.e.,  to  create  starting  regions 
for  the  parallel  region  growing  algorithms)  can  be  achieved  by  using  un- 
supervtsed  clustering  techniques.  Each  picture  element  Is  represented 
by  a point  In  the  feature  space.  The  features  could  be,  for  example, 
apectral  contents  of  the  picture  element,  means  and  variance  of  the 
spectral  contents  (or  more  generally  texture)  of  a small  neighborhood 
around  the  picture  element  In  question.  The  feature  space  Is  then  par- 
titioned by  some  clustering  algorithm.  The  picture  elements  In  each 
cluster  are  considered  as  In  the  same  region  In  the  original  Image.  The 

region  Is  generally  disconnected  - It  contains  several  connected  sub- 
regions. 

The  main  thrust  of  our  current  research  In  ,mage  segmentation  Is  to 
develop  efficient  algorithms  to  do  the  parallel  segmentation  Including 


initialization,  and  to  find  (for  various  classes  of  images)  suitable 
measures  of  similarity  between  the  regions.  Some  preliminary  results  of 
unsupervised  clustering  using  various  feature  pairs  are  contained  in  the 
report  by  Yoo  and  Huang. 

After  we  have  gained  experience  in  parallel  merging  and  similarity 
measures,  we  shall  move  on  to  the  development  of  hierarchical  segmenta- 
tion techniques  and  the  interaction  between  segmentation  and  structural 

scene  analysis. 

IMAGE  ATTRIBUTES:  Characterizing  the  primitives 

(i)  Texture  Analysis 

The  success  of  the  region-growing  approach  to  image  segmentation 
depends  critically  on  the  suitable  choice  of  a similarity  measure  be- 
tween regions  which  in  turn  is  largely  a problem  of  texture  analysis. 

We  plan  a major  research  effort  in  the  understanding  of  image  textures 
and  the  use  of  textures  in  image  information  extraction. 

One  major  problem  of  texture  analysis  is  that  there  is  no  common 
definition  of  image  texture.  Our  definition  will  be  an  iterative  de- 
velopment starting  with  sample  textures  from  a book  by  Brodatz,  photo- 
graphs of  natural  scenes,  aircraft  data,  and  LANDSAT  imagery.  As  we 
develop  texture  descriptors  and  classification  algorithms,  our  texture 
definitions  will  be  modified  to  incorporate  the  new  findings. 

The  main  thrust  of  our  current  research  is  to  find  texture  de- 
scriptors for  pattern  classification  which  are  invariant  to  illumination 
and  resolution  changes,  and  are  computationally  simple;  and  to  use  these 
texture  descriptors  to  find  texture  boundaries  in  images. 

We  have  developed  a new  class  of  texture  descriptors,  the  max-min 
descriptors,  which  are  computationally  very  simple  and  which  work 
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extremely  well  In  our  preliminary  classification  and  boundary  finding 
experiments.  Some  results  of  texture  boundary  finding  are  shown  In  the 
report  by  Mitchell. 

Our  study  on  texture  is  closely  related  to  our  study  on  image  seg- 
mentation, since  similarity  measures  among  regions  will  no  doubt  include 
texture  descriptors.  Texture  will  probably  be  relied  on  heavily  !n 
hierarchical  image  segmentation. 

it  Is  evident  that  texture  can  be  defined  on  several  levels;  e.g., 
the  detailed  texture  of  a small  rock  and  the  overall  texture  of  hundreds 
of  rocks.  Complex  images  may  be  described  in  terms  of  several  levels  of 
textures.  By  adjusting  the  parameters  In  an  image  segmentation  algo- 
rithm, we  can  first  divide  the  iamge  Into  regions  according  to  micro- 
texture.  Then  we  merge  some  of  these  regions  together  according  to  the 
next  higher-level  texture,  etc.  This  approach  Is  particularly  suitable 
for  a syntactic  description  of  the  image.  Depending  on  the  particular 
application  at  hand,  we  will  consider  the  regions  at  a certain  tfvture 
level  as  the  primitives.  Then  a grammar  will  be  derived  to  describe 
how  these  primitives  combine  to  form  higher-level  regions. 

(ii)  Shape  Analysis 

We  are  attacking  the  shape  analysis  problem  both  from  the  classical 
decision-theoretic  point  of  view  and  the  more  recent  syntactic  approach. 
In  the  classical  framework,  the  first  step  in  pattern  recognition  is 
feature  extraction.  And  one  important  class  of  features  in  picture  pro- 
cessing and  recognition  is  that  of  shape  descriptors.  Recent  work  by 
Fu  and  Persoon  indicates  that  Fourier  boundary  descriptors  are  quite 
effective  for  classifying  hand-printed  numerals  and  some  medical  images. 
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In  terms  of  optimal  curve  matching  In  shape  analysts,  a distance 
measure  has  been  proposed.  The  distance  measure  Is  expressed  as  the 
Euclidean  metric  between  the  two  sets  of  Fourier  descriptors  represent” 
ing  two  different  curves.  The  minimization  of  the  distance  measure  re- 
duces to  the  solution  of  roots  of  a periodic  function  with  a finite 
number  of  harmonics.  This  is  also  one  of  the  advantages  of  using 
Fourier  descriptors  instead  of  performing  the  minimization  of  a distance 
measure  defined  directly  in  the  spatial  domain. 

We  are  refining  the  Fourier  descriptors  and  applying  them  to  the 
shape  analysis  of  military-related  patterns.  Specifically,  we  are  using 
the  Fourior  boundary  descriptors  for  airplane  shape  recognition.  The 
results  are  very  encouraging  (see  the  report  by  Wallace  and  Wlntz). 
c*  IMAGE  STRUCTURE:  Syntactic  scene  analysts  - Determining  the  Relation- 

ships Among  Primitives 

Much  of  the  work  to  date  with  respect  to  the  computer  analysis  of 
image  data  (and  especially  mul t ispectral  image  data)  from  high-flying 
aircraft  and  earth-orbiting  satellites  has  concentrated  on  the  spectral 
domain;  l.e.,  the  characterization  of  each  point  in  the  image  (on  the 
ground)  In  terms  of  its  "color".  Little  attention  has  been  paid  to  the 
information  contained  in  the  spatial  relationships  in  the  data,  even 
though  past  experience  with  photo-interpretation  indicates  that  this 
information  component  is  substantial.  We  are  studying  the  character- 
ization of  the  spatial  Information  content  of  such  imagery  in  two  ways: 
(1)  syntactic  Image  models,  and  (2)  Markov  dependency  models  of  context. 

Although  the  syntactic  or  "linguistic"  approach  to  image  des- 
cription and  analysis  has  been  widely  cited  as  holding  great  promise, 
little  practical  work  of  this  nature  has  yet  appeared.  However,  Fu  and 


Brayer  have  recently  made  progress  In  this  direction,  developing  a 
grammar  for  the  description  of  roads  and  for  the  detection  of  clouds  and 
cloud  shadows  In  satellite  Imagery.  An  extension  of  the  work  of  Fu  and 
Brayer  Is  contained  In  the  report  by  Fu,  et  al. 

The  power  of  the  syntactic  approach  In  accounting  for  pattern 
structure  (both  spatial  and,  potentially,  spectral  dependencies)  makes 
It  a natural  avenue  to  pursue  for  the  description  and  "understanding" 
of  such  Imagery.  Specific  applications  Include  the  detection  and  classi- 
fication of  man-made  objects;  detection  of  geological  structures  In- 
dicative of  ore  deposits  and  suitable  construction  sites;  and  land-use 
and  natural  resource  Inventory. 

Classifiers  which  operate  In  the  spectral  domain  generally  classify 
a single  Image  point  at  a time.  Utilization  of  context  - the  informa- 
tion contained  in  local  spatial  dependencies  among  Image  points  - has 
been  shown  to  Improve  significantly  the  accuracy  with  which  classifi- 
cation can  be  performed  and  images  described.  One  way  to  charar^rlze 
such  dependencies  Is  In  terms  of  Markov  processes,  l.e.,  statistical 
transition  properties.  We  conjecture  that  characterization  of  local 
scene  dependencies  In  this  manner  could  provide  useful  "picture  primi- 
tives" for  input  to  syntactic  models  such  as  we  have  discussed  above. 

The  objective  of  this  aspect  of  the  research  is  to  determine  the 
utility  of  two-dimensional  (and  possibly  higher-dimensional ) stochastic 
dependencies  for  characterizing  spatial  relationships  and  Improving 
classification  In  Image  data.  These  methods  will  be  applied  to  Image 
data  In  two  ways:  as  part  of  the  classification  operation  per  se,  and 

In  a post  classification  mode  where  the  Input  has  been  "pre-classif led" 
using  Its  mul tlspectral  characteristics.  The  Impact  of  parallel 
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processing  on  the  efficiency  of  this  type  of  enelysls  will  be  assessed. 

Ho re  details  on  the  use  of  context  are  contained  In  the  report  by 

Fu.  et  ale 

0.  IMAGE  RECOGNITION  TECHNIQUES. 

Many  pattern  recognition  problems  nmy  be  viewed  as  combinatorial 

optimization  problems.  Exhaust, nation  can  determine  the  optimum 

solution,  but  the  required  computation  Is  prohibitive.  One  of  the  prom 
Islng  approaches  to  overc^  this  difficulty  Is  the  branch  and  bound 
TOthod.  In  exhaustive  search,  we  search  ...  branches  of  a tree  Into  the 
bottom  and  evaluate  a criterion  via.  for  each  path.  In  the  branch  and 
bound  method  each  node  of  the  tree  provides  Informat.on  as  to  whether  or 
not  the  search  goes  d«n  further.  If  the  answer  Is  no.  the  search 
bounces  back  ft.  the  node  without  trying  the  branches  under  the  node. 

Thus,  the  number  of  computed  paths  Is  expected  to  be  significantly  smaller. 

Our  objective  Is  to  apply  this  technique  effectively  to  pattern 
recognition.  Hany  problems  In  pattern  recognition  can  be  reformulated  In 
terms  of  the  branch  and  bound  .M.  In  order  to  meet  our  objective,  we 
must  find,  for  each  problem,  a proper  criterion  which  evaluates  the  value 
of  each  branch  and  also  can  provide  the  needed  Information  at  each  node. 
The  problems  we  are  considering  Include:  feature  selection,  clustering. 

and  classifier  design. 

E.  applications 

(,)  Locating  Urge  Concrete  Airports  In  LANOSAT  Imagery 

We  first  use  spectral  Information  to  locate  concrete  areas  In  the 

lmage.  Then  we  find  straight  line  segments  in  these  areas.  Finally. 

syntactic  Information  will  be  used  to  separate  airports  from  highways. 
Some  results  will  be  presented  In  a future  report. 
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(II)  Missile  Tracking 

The  main  thrust  Is  to  develop  real-time  techniques  for  locating  and 
tracking  a missile  In  TV  Images.  Edge  detection  and  change  detection 
techniques  will  be  used.  Because  of  the  real-time  requirement  the  pro- 
cessing time  is  very  limited. 

Surface  acoustical  wave  scanners,  being  developed  at  Purdue,  seem  to 
be  suitable  for  this  application.  A preliminary  Imaging  experiment  using 
a SAW  scanner  Is  described  In  the  report  by  Gunshor  and  Huang. 

(III)  Reducing  Clouds  and  Haze  In  LANDSAT  Imagery 

By  modeling  clouds  and  haze  as  a multiplicative  process,  we  attempt 
to  reduce  their  obscuring  effects  by  three-dimensional  (2  spatial  dimen- 
sions, 1 spectral  dimension)  digital  filtering.  Some  initial  computer 
simulation  results  are  shown  In  the  report  by  Mitchell  and  Chen.  Cur- 


rently, the  technique  Is  being  applied  to  LANDSAT  Imagery. 
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IMAGE  DECOMPOSITION 
J.  W.  Burnett  and  T.  S.  Huang 

In  previous  work  [1]  we  have  Investigated  some  of  the  algorithms  for 
image  noise  reduction  available  If  we  model  an  Image  as  a finite  state  Markov 
process.  This  model  has  led  to  non-linear  processing  methods  that  are  supe- 
rior to  current  linear  methods  in  images  with  sharp  edges  [1], 

Currently  we  are  studying  the  application  of  the  nodel  to  the  problem  of 
Image  segmentation. 

Segmentation  algorithms  are  valuable  in  pattern  recognition  [33  and  data 
compression  [2]  and  potentially  valuable  in  making  measurements  of  objects 
characterized  by  gray  level  boundaries.  Unfortunately  the  problem  of  image 
segmentation  is  not  as  simple  as  the  application  of  a derivative  operator  to 
find  boundaries.  Edges  are  blurred  by  camera  motion,  lenses,  atmospheric 
turbulence,  sampling,  etc.  and  there  is  always  noise  due  to  film  grain, 
quantization,  transmission,  and  so  forth.  Thus  any  segmentation  algorithm 
must  be  capable  of  dealing  with  Image  blur  as  well  as  noise. 

In  this  report  we  examine  an  algorithm  based  on  our  finite  state  model 
that  produces  maximum  liklihood  sequence  estimates  of  the  boundary  component 
of  digitized  Images.  This  algorithm  has  tie  capability  to  deal  with  image 
blur  and  noise  with  great  speed.  While  the  technique  has  no  immediate  exten- 
sion to  two  simultaneous  dimensions  we  will  suggest  a way  that  a two-dimen- 
sional problem  can  be  handled  one-dimension  at  a time. 

The  Problem 

Suppose  that  we  model  an  Image  scan  line  ]_  as  a finite  state  Markov  pro- 
cess as  shown  in  Fig.  1.  This  model  assumes  that  the  gray  level  across  a scan 
line  is  piecewise  constant  and  takes  on  a finite  number  of  possible  gray 
level  values.  Actual  scan  lines  do  not  look  like  Fig.  1,  however,  so  we  model 


J"j-— 
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the  available  scan  I l„e  Z as  bel„9  the  Idea,  line  (passed  through  the  system 
Of  Fig.  2. 

reasonable  way  to  extract  the  boundary  component  of  the  available  line 
z Is  to  calculate  the  maximum  a posteriori  estimate  f of  I.  In  other  words 
we  want  to  find  an  estimate  ? that  maximizes  P(l|z). 

A well  known  solution  to  the  problem  Is  due  to  Vlterbl  [A]  (see  also  [5] 
and  [6]).  The  Vlterbl  algorithm  not  only  provides  a map  estimate  of  the 

boundary  component  but  requires  computation  time  on  the  order  of  N where  N Is 
the  number  of  points  to  be  processed. 

However,  before  the  Vlterbl  algorithm  can  be  used  some  modifications  have 
to  be  made.  First,  the  possible  states  that  the  boundary  process  can  assume 
may  not  be  known  a priori.  this  case  the  possible  states  must  be  obtained 
from  the  available  data  Z.  There  are  a number  of  ways  that  this  can  be  done. 

The  technique  used  here  is  a clipped  local  average,  which  has  the  advan- 
tage of  being  very  fast.  There  are  many  clustering  techniques  that  might 

yield  more  accurate  estimates  but  the  increased  accuracy  would  require  more 
computation  time. 

Another  problem  that  must  be  dealt  with  Is  that  In  picture  processing 
the  blurring  function  H is  typically  a function  of  two  variables  and  a modifi- 
cation of  the  algorithm  must  be  made,  in  many  Instances  H will  be  separable 
and  the  image  can  be  processed  row  by  row  and  then  column  by  column.  If  H Is 
not  exactly  separable  it  can  be  broken  up  into  a separable  sum  [/] : 


where  the  various  terms  are  defined  In  (7).  Thus  even  though  the  degrading 
system  H is  two-dimensional  we  can  process  the  image  a line  at  a time. 
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Example 

A 256x256  letter  "G"  was  blurred  with  an  Impulse  response 


.21 

.56 

.21 

.01 

/ 


Noise  was  added  to  produce  a SNR  of  20  dB  (see  Fig.  3).  This  noisy  blurred 
image  was  processed  by  our  algorithm.  As  shown  In  Fig.  4 the  noise  was  re- 
duced and  the  smearing  eliminated.  The  computer  time  required  was  241  seconds. 
However,  this  includes  130  seconds  required  to  transpose  the  image  with  the 
only  available  routine.  A more  efficient  transposition  routine  would  greatly 
decrease  the  time  required. 


Fig.  5 shows  a scatter  plot  of  a scan  line  of  the  estimated  image  plotted 
about  the  scan  line  of  the  original  Image.  In  this  line  (and  in  fact  most 
lines)  the  boundary  locations  were  estimated  correctly.  Thus  our  algorithm 

should  be  useful  for  making  accurate  measurements  of  objects  in  digitized 
images. 

Cone I us  Ions 


The  algorithm  promises  to  provide  a fast  and  accurate  method  of  segment- 
ing a noisy  and  blurred  image  Into  regions  of  constant  gray  level. 

Some  additional  work  will  be  necessary  before  the  promise  is  realized. 

A careful  examination  of  Fig.  4 shows  that  a few  errors  in  estlmat  on  were 
made.  Most  of  these  are  due  to  a poor  determination  of  the  true  picture  levels 
by  our  local  averaging  techniques  (see  Fig.  5 for  example).  Thus  our  next 
task  is  to  find  better  ways  of  determining  the  possible  states  from  the 
blurred  and  noisy  data. 
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i = gray  level  of  Ideal  scan  line  at  position  K 
K 

z = gray  level  of  actual  scan  line  at  position  K 
K 

Fig.  2 Model  of  Scan  Line  Degradation 


Fig,  3 The  Available  Image 


Fig.  b The  Estimate  of  the  Boundary 
Component  of  the  Image 
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Fig,  5 Actual  and  Estimated  Scan  Line 
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A STUDY  OF  DIGITAL  STRAIGHT  LINES 
G.  Y.  Tang  and  T.  S.  Huang 

his  report  consists  of  four  sections.  In  section  I we  Investigate  some 
fundamental  propert.es  of  the  digitization  of  a straight  line.  sect.on  „ 

" ,00k  'nt0  * SPeC'8'  Cla“  °f  Stra'9ht  «l»se  slopes  are  rational. 

section  III  we  develop  a scheme  to  approxl™te  any  straight  .In.  by  . rat,onal- 

slep.  straight  such  that  within  the  scope  of  the  digital  picture  they  give 
the  same  digitization.  In  sect.on  IV  we  develop  technics  to  recognize  and 
identify  the  digitization  of  a straight  line. 

Section  I 

The  digitization  scheme  used  this  paper  Is  the  standard  grld-lnt.rsec- 
tl°n  SCh'",a  f°r  Cha,n  “dlr9  - n«t  developed  by  Fre^an  [,].  G,ven 
a coordinate  grid  superimposed  on  the  straight  line,  the  grip  point  nearest  to 
the  crossing  becomes  a point  of  the  digitization  of  that  line.  (,„  the  case 
the  crossing  Is  exactly  midway  between  two  grid  points,  we  shall  assuM  for 
concreteness  that  w.  always  round  down  to  the  one  having  smaU.r  value.,  The 
digitization  points  can  then  be  connected  by  straight  line  segnmnts  called 
elements.  There  are  only  eight  possible  directions,  numbered  from  0 to  7 
counterclockwise  sense.  0 Is  corresponding  to  the  e.ement  lying  +X  direc- 
tion. Those  elements  have  only  2 lengths.  I and  2.  A sequence  of  the  ele- 
ments  Is  called  a chain. 

Some  observations  about  the  properties  of  the  digitization  of  a straight 
line  are  outlined  as  follows  [2]. 

^ I)  When  the  straight  line  to  be  digitized  has  „s  slope  strictly  between 
-‘>5  and  ks  , Its  digitization  depends  only  on  Its  crossings  of  the  vertical 
grid  lines.  A simple  and  efficient  method  to  generate  the  chain  of  a straight 
line  follows  Immediately  from  this  observation.  A line  Is  defined  by 
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Y = mx  + Y.  , Q < m < i . 

D — 

The  Ith  Freeman  Code  is  found  by  Y{  - Y|+|,  where  Y(  - [mi  + Yb  + 1/2],  [X] 
indicating  the  greatest  Integer  smaller  than  X. 

2)  Likewise  for  a straight  line  with  slope  between  and  i35°,  only 
horizontal  crossings  piay  a role  in  the  digitization  of  that  tine. 

3)  For  the  digitization  of  a straight  line,  there  are  at  mast  two  types 
of  elements  present,  and  these  can  differ  only  by  unity,  moduio  eight.  One 
exarrple  is  the  straight  line  with  slope  between  0°  and  kS° , oniy  0 and  1 can  be 
presented.  Particularly  for  slope  equal  to  7/36  its  Freeman  Code  is 

0001 0001 000 i 000 i 0001 0001 00001 0001 .... 

k)  One  of  the  two  elements  mentioned  in  (3)  always  occurs  singly,  in 
the  above  example  it  is  obvious  that  i occurs  singly,  if  we  look  into  this 
property  further,  we  wiii  see  that  for  straight  lines  lying  between  0°  and 
26.5°,  1 occurs  singly  and  for  those  lines  lying  between  26.5°  and  1*5°,  0 
occurs  singly. 

5)  Successive  occurances  of  the  element  occuring  singly,  as  mentioned  in 
(3)  arr*  as  uniformly  spaced  as  possible.  Considering  the  same  example  again, 
we  can  see  the  spaces  between  two  adjacent  I's  are 

333333^333333^333333^* 

where  3 indicates  three  zeroes  between  them  and  k indicates  four  zeroes  between 
them.  We  may  as  weii  caii  each  successive  occurance  of  0's  a run.  Then  there 
are  at  most  two  run  lengths  and  they  are  consecutive  integers.  One  of  these 
two  runs  wiii  always  occur  only  once  at  a time.  The  other  one  of  course  will 
occur  in  runs.  And  furthermore,  for  the  run  length  that  occurs  in  runs. 
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these  runs  can  themselves  have  only  two  lengths,  which  are  consecutive  Integers 
and  so  on  An  example  Is  for  a line  of  slope  14/39.  The  chain  code  Is 

001 001 001 001 01 001 001 001 001 01 001 001 001 01 ... 

In  terms  of  run  length,  we  will  have 

222212222122221... 

So,  we  can  see  the  run  of  length  1 occurs  singly  and  the  run  length  of  the 
other  run  are  two  consecutive  integers,  4 and  3.  3 occurs  singly. 

6)  Is  there  a necessary  and  sufficient  condition  that  a digital  arc  is 
the  digitization  of  a straight  line?  If  we  can  find  a yes  answer  to  it,  can 
we  derive  an  efficient  necessary  and  sufficient  testing  algorithm  for  a digital 
arc7  Rosenfeld  answers  the  first  question  in  terms  of  the  chord  property? 

Let  a,  b be  points  of  a digital  arc  D,  and  let  aF  denote  the  real  line 
segment  between  a and  b.  We  say  that  ab  lies  near  D If  for  any  real  point 
(x,y)  on  ab,  we  can  find  a grid  point  (l,J)  on  D such  that  ma\[||-x|  , |j-y|]  <1. 
We  say  that  D has  the  chord  property  if  and  only  if  for  every  pair  of  a,  b In 
D,  the  chord  ab  lies  near  S.  The  following  two  statements  can  thus  be  proved. 

(1)  The  digitization  of  a straight  line  segment  is  a digital  arc  and  has  the 
chord  property;  (ll)  If  a digital  arc  has  the  chord  property,  we  can  find  a 
straight  line  segment  such  that  the  digital  arc  is  the  digitization  of  that 
straight  line.  It  is  noticed  in  the  sequal  that  the  digitization  of  any  curve 
lying  on  a certain  straight  strip  will  generate  the  same  digital  arc,  accord- 
ing to  our  grid-intersection  scheme. 

Section  II 

In  this  section  we  put  our  attention  on  a subset  of  a straight  line  whose 
slopes  are  rational  numbers.  A tional  number  Is  a real  number  which  can  be 
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characterized  by  two  integers  p,q  (no  common  factor  between  p and  q).  We  are 
going  to  see  a sort  of  periodic  properties  in  this  subset  of  straight  lines. 

We  also  are  going  to  examine  the  effect  of  Y intersection. 

A Freeman  chain  code  F is  said  to  be  periodic  if  we  can  find  a subchain 
T of  F such  that  F = T*R,  I = 1,2,...,R  is  a subchain  of  T.  In  order  to  have 
this  definition  actually  significant,  we  assume  |F|  » |T|  where  |F|  Is  the 
length  of  F and  |T|  Is  the  length  of  T.  In  the  following  discussion,  we  assume 
the  length  of  Freeman  Code  is  arbitrarily  long.  We  also  say  that  two  Freeman 
chains  are  cyclically  equivalent  If  we  can  obtain  one  from  the  other  by  a cer- 
tain  number  of  cyclic  shifts.  Each  cyclic  shift  is  achieved  by  removing  the 
leftmost  code  from  the  chain  and  to  place  it  next  to  the  rightmost  code  on  the 
chain.  The  number  of  shifts  needed  between  two  cyclically  equivalent  chains  is 
called  the  displacement  between  the  two  chains.  Obviously  the  displacement 
is  always  smaller  than  |t|. 

A real  straight  line  with  rational  slope  is  characterized  by  the  equation: 
Y = p/q  X + Yfa 

p,q  are  Integers  with  no  common  factors  and  q is  not  zero.  The  Freeman  chain 
associated  with  the  digitization  has  the  following  properties. 

I)  The  Freeman  chain  is  periodic  with  period  T.  So  in  order  to  Investi- 
gate the  relation  between  the  slope  and  the  Freeman  chain,  we  need  only  to 
observe  one  period.  Notice  that  we  need  only  to  consider  lines  between  0 an 
45°  without  loss  of  generality.  Referring  to  (5)  in  section  I,  "spaced  as 
uniformly  as  possible",  Brons  [3]  proposed  a structural  algorithm  to  construct 
a period  from  p and  q.  The  algorithm  is 

(1)  Read  p and  q under  the  assumption  0 < p < q;  p,q  are  integers. 

(2)  Reduce  p|q  if  possible,  call  the  new  value  pQ  and  qQ. 
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(3)  Calculate  r * q - p . (There  are  r symbols  0 and  p symbols  1 
o o o o o 

In  a period.) 

00  Let  Cj  ■ Hax(Pj,rj);  call  symbols  occurlng  Cj  times  Cj. 

(5)  Let  tj  » Hln(Pj,rj);  call  symbols  occurlng  tj  times  Dj. 

C | 

(6)  If  tj  ■ 1,  generate  the  period  Cj  Dj  and  stop  the  program. 

(7)  Calculate  Oj  - [cj/tj]. 

(8)  Calculate  Pj+j  = Cj  - t^. 

(9)  Calculate  rJ+J  “ tj  - P|+1> 

nl 

(10)  Generate  rj+j  subperiods  Cj  Dj,  call  them  Bj+j. 

^1+1 

(11)  Generate  Pl+1  subperiods  Cj  Dj,  call  them  Aj+j. 

(12)  Go  to  step  !». 

2)  There  Is  no  displacement  between  the  following  two  lines 

Lj : y *=  (p/q)X  + Yb 

L2:  y - (p/q)X  + (Yfa  + N)  , N = ...  -2,-1, 0,1 ,... 

3)  We  now  look  Into  the  displacement  between  two  lines  with  the  sane 
slope  but  differing  In  y Intersection  by  the  amount  of  1/q.  These  two  lines 
are  characterized  by 

L,  : y - p/q  * Yb 
1 2 : Y “ p/q  X + (Yb  + 1/q). 

The  amount  of  shift  of  the  corresponding  Freeman  chains  must  meet  the  require- 
ment 


Ay  = p/q  Tj  + 1/q  - n 


Ik 


where  is  an  integer,  n is  an  integer 


n is  the  smallest  one.  For  example,  the  Freeman  code  for  p/q  - 7/36,  yfa  - 
0.3333  is 


1 00000 1 0000 1 0000 100001 0000 1 0000 1 


The  difference  is  5 which  can  also  be  obtained  by  plugging  in  (1)  with  n=l, 
q = 36,  p = 7. 

i»)  It  is  noticed  that  without  loss  of  generality,  only  straight  lines 
with  y intersections  in  an  interval  of  length  one  are  needed  to  be  considered. 
For  convenience,  we  pick  up  the  Interval  Recall  that  the  Freeman  chain 

of  a straight  line  with  rational  slope  p/q  has  period  q.  So  there  are  at  most 
q different  chains  corresponding  to  that  slope,  it  is  found  that  each  of  the 
following  Intervals  determines  a chain 


4 . r4> . <K . K> <K  • • 


2 q 


(3.1  3} 
'•  » l2  q * 2 


The  Freeman  code  of  a straight  line  with  y Intersection  in  the  interval 

(1  1+i)  is  the  one  got  by  the  structural  algorithm.  The  one  in 

v2  ’ 2 q' 

(1+1 , 1+-)  is  obtained  by  shifting  r.. 

2 q 2 q 11  1 

All  the  lines  with  intersection  In  the  strip  defined  by  two  parallel 

lines  y = p/q  X + a and  y = p/q  X + a + 1/q  should  have  the  same  digitization. 
More  than  that,  all  curves  lying  in  that  strip  should  have  the  same  digitiza- 
tion too.  Thus  there  are  many  curves  corresponding  to  one  digital  curve. 

Section  1 1 1 

In  Section  II.  we  have  outlined  a few  properties  of  the  digitization  of  a 
subset  of  all  straight  lines.  That  particular  subset  Is  the  set  of  all 
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straight  lines  with  rational  slope.  In  this  section,  we  are  going  to  look  Into 
real-slope  lines.  Our  goal  Is  to  find  an  algorithm  such  that  for  any  real 
slope  we  can  find  a rational  slope  such  that  the  digitization  of  these  two 
Straight  lines  are  identical  for  the  first  certain  number  of  left-most  codes. 
Assume  the  straight  line  with  a real  slope  Is 


L1  : y - aX  + Yfa  , aeR,  o < a < 1,  1/2 

t 

the  one  with  rational  slope  Is 


b » “ & w ^ u v i f i/z  < Y.  < — , 

d 2 


L,  = y-ixeJUi 


N 2 ’ 
x 4 


N Is  an  integer  such  that 


0 < Y.  - - - — Y - I - N+1  s n P 

b 2 Nv  * Yb  2 ~ > 0 ' fT  < a : 


From  Fig.  1,  we  can  see 
that 

A - (<*Nx+yb) 


/_£_  w , 1 . N, 

(N~  Nx  + 2 + — ) 
X X 


aN  - P + (y  - iL_I) 

X vyb  Nx  2 


Figure  1 Rational  approximation  of  a real  slope 

Obviously,  we  have  Y.-JL-lcAc-L 

b Nx  2 " • 


and  aNx  >_  p,  p is  an  Integer, 


So,  we  got  p > aN  - ~+  (y  -iL-I) 
- x N v,b  N 2 
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An  algorithm  to  find  the  rational  number  Is  hereby  constructed. 

1.  Read  a • Yfa,  Ng  (a  constant  for  Increment), 

2.  Set  Nx  = Nq  (Nq  Is  an  Initial  value). 

3.  Find  N such  that  0 < Yfa  - 1 - N/Nx  and  Yb  - I - M > 0. 

4.  Set  N * N ■ N , 

x x s 

5.  Set  PP  - (oNx  + Yb)  - + X) . 

6.  Set  P = [PP]  +1. 

7.  Is  P <_  aN^?  If  It  Is  yes,  then  stop,  otherwise  N « N + N , go  to  2. 

x x s 

Usually,  Ng  is  set  to  be  1.  This  algorithm  will  give  us: 
a ■ 0.15315606  , P-27,  Q-  111,  and  P/Q-  0. 15315315. 

a - 0.23648940  , P-35,  Q = 148,, and  P/Q-  0.23648609. 

a = 0.81982273  , P-91,  Q = 111,  and  P/Q- O.8I98I982. 

Section  IV 

From  Section  I to  Section  1 1 1 we  have  investigated  various  inherent  prop- 
erties of  a straight  line,  in  this  section,  we  will  try  to  combine  all  the 
knowledge  we  have  and  to  see  if  we  can  use  it  to  recognize  a digital  arc,  i.e., 
to  determine  whether  there  is  a straight  real  line  corresponding  to  that 
digital  arc. 

One  immediate  application  of  the  properties  to  do  recognition  is  that  if 
we  know  already  that  a string  of  chain  codes  Is  a period,  then  we  can  proceed 
as  following: 

1.  The  length  of  that  string,  let  it  be  q. 

2.  The  number  of  the  elements  which  occur  singly,  let  it  be  P. 

3.  Using  the  structural  algorithm  to  generate  a period  of  chain  code, 

which  is  called  CC. 
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*•  C0"par,n9  the  '"P“‘  “ring  with  the  new  chein  CC  to  see  If  they  are 
cyclical  equivalent. 

5.  If  they  are  cyclical  equivalent,  then  the  string  is  the  digitization 
Of  a straight  line. 

The  foregoing  algorithm  Is  very  limited.  But  we  can  extend  the  concept 
to  a more  general  case.  Suppose  we  have  a string  of  Freeman  code  a,  hand.  * 
want  to  decide  if  It  Is  the  digitization  of  a straight  line.  The  new  one 
would  turn  out  rather  lengthy  In  computing  time.  In  order  to  alleviate  the 

tragic  time  consumption,  some  simple  a priori  test  may  be  performed.  Examples 
of  the  a priori  test  are: 

0)  Are  there  only  two  elements  one  of  which  occur  Slngly7 

(2)  there  only  two  run  lengths  which  are  consecutive  integers?  And 
one  of  them  occur  singly? 

(3)  Are  the  run  lengths  of  the  runs  that  occur  In  runs  two  consecutive 
integers? 

'f  a digitization  obtained  all  "yes"  in  the  a priori  test,  then  we  can 
find  the  ratio  of  the  number  of  the,  element  ocurrlng  singly  to  the  length  of 
the  Input  string.  This  ratio,  say,  is  p/q.  We  can  thur  know  that  If  it  is  a 
digitization  of  a straight  line,  the  slope  must  be  In  the  Interval  enclosed  by 
(p+l )/q  and  (p-l)/q.  So  the  rational  equivalent  line  must  have  a slope  between 
(p+l J/q  and  <p-l)/q.  Unfortunately,  there  are  infinite  but  countable  number  of 
rational  numbers  lying  between  (p+l)/q  and  <p-l)/q.  |f  We  got  to  test  ail  of 
them.  It  would  be  Impractical.  How  to  reduce  the  number  of  testing  to  be 
finite  is  still  unsolved.  But  a simple  heurlsAc  solution  might  be  applied  by 
setting  an  upper  limit  of  the  number  of  testing.  If  we  go  beyond  the  limit, 
we  would  claim  heurlstlcaiiy  that  we  can  never  find  a rational  slope  to  give 
an  equivalent  digitization  to  the  real  slope  line.  From  the  argument  in 
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section  til,  this  is  a contradiction.  Hence  we  assert  that  it  is  not  a 
digitization  of  a straight  line.  The  Freeman  code  of  the  curve, 

y •=  7/36  X + 0.50926  + X2 

which  only  slightly  deviates  from  a straight  line  y = 7/36  X + 0.50926,  when 
X is  not  very  big,  is 

0000100001 00001100001 0010100001 1 

We  can  see  that  it  will  be  rejected  in  the  a priori  testing  stage.  But  if  we 
knew  only  the  first  15  elements,  i.e.,  000010000100001,  we  would  have 
p/q  - 3/15.  So  the  two  limits  are  VI 5 and  2/14.  Obviously,  2/15  < 7/36  < 
4/15.  If  we  could  test  enough  rational  numbers  between  2/15  and  4/15  we  would 
find  7/36  and  we  would  assert  that  it  is  a digitization  of  a straight  line, 
and  indeed  it  is. 
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IMAGE  SEGMENTATION  BY  UNSUPERVISED  CLUSTERING 
M.  Y.  Yoo  and  T.  S.  Huang 

I . Introduction 

We  have  shown  results  of  image  segmentation  based  on  the  feature  pair: 
local  mean,  local  standard  deviation;  and  various  clustering  algorithms  (sim- 
ple merging  by  Euclidean  distance,  graph  theoretic  clustering)  in  the  last 
three  reports.  Our  experience  has  confirmed  that  it  Is  more  Important  to  find 
better  features  which  give  significantly  separated  clusters  than  to  develope 
an  efficient  clustering  algorithm.  We  now  show  results  of  Image  segmentation 
based  on  other  feature  pairs. 

I I . Experimental  Results 

We  will  describe  several  feature  pairs  and  show  sone  results  based  on 

them. 

(1)  Local  mean  and  standard  deviation: 


(x-m,n-y) 


(x-m,y+n) 


X X---X 


n 

+ 


X«—  m >X  X 

' (**y)  i 

X X X 


(x+m,y-n) 


(x+m, y+n) 


Figure  I Local  Window  Used  for 
Local  Operations 


Let  P(x,y)  be 
standard  deviation 


the  picture  array  then  the  local  mean  M(x,y)  and  the  local 
S(x,y)  at  (x,y)  based  on  the  local  windows  of  size  mxn  are 


. x+m  y+n 

M*X,y^  “ (2m+l)  (2n+l)  Z 1 

y=x-m  a=y-n 


x+m 


E 

y=x-m 


y+n 

E {P(y,a)  - M(x,y)} 
a=y-n 


S(x,y) 


TTm+l'Hin+l ) 


1/2 
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The  size  of  the  window  mxn  may  be  chosen  appropriately  for  the  images  to 
be  considered.  rm>na1  has  been  used. 

(2)  Local  median  and  modified  deviation: 

Let  D ■ {(a, 6):  a - x-m x+m;  8 - y+n} 

then  the  local  median  and  the  modified  deviation  are 

MED  (x,y)  ■ median  P(a, 8) 

(a, 8)  e 0 

DEV  (x,y)  * median  | P(a,8)  “ MED  (x,y)  | 

(a, 8)  e 0 

The  feature  plane  based  on  this  measure  is  given  in  Fig.  3 (computer  print 
out)  and  Fig.  4 (Gould  print  out).  The  Fig.  5 and  Fig.  6 show  the  filtered 
feature  and  the  corresponding  image,  respectively. 

(3)  Local  minimum  and  local  maximum: 

M 3 N (x,y)  * minimum  P(a,8) 

(a,  8)  e 0 

MAX  (x,y)  * maximum  P(a,8) 

(a,  8)  e 0 

Figures  7,  8,  and  9 show  feature  plane  and  filtered  images  based  on  local 
minimum  and  local  maximum. 

(4)  The  number  of  "jumps"  and  the  average  magnitude  of  the  jump: 

We  compare  two  adjacent  points  (in  all  directions)  in  the  local  window 
and  if  the  gray  level  change  is  greater  than  the  preassigned  threshold  value 
we  assumed  there  is  a jump. 

We  take  the  total  number  of  jumps  in  the  local  area  as  a feature  1 and 
the  average  magnitude  of  a jump  as  a feature  2. 

feature  1 ■ total  number  of  jumps  in  the  local 
image  of  size  mxn 
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feature  2 ■ the  average  magnitude  of  a jump 
in  the  area 

Figures  10  and  li  show  the  distribution  of  features  for  two  different 
threshold  values  and  the  filtered  features  and  the  corresponding  images  are 
shown  in  Figures  12  and  13* 

This  model  can  be  modified  a little  bit  in  counting  the  total  number  of 
local  jump. 

In  feature  i we  counted  one  jump  whenever  the  change  in  gray  levels  of 
adjacent  points  exceeds  the  threshold  value,  but  in  the  modified  version  two 
successive  jumps  in  the  same  direction  (up  or  down)  are  counted  as  one  jump. 

Figure  14  is  the  feature  based  on  this  modified  version  and  Figure  15 
and  i6  show  the  filtered  feature  and  image  correspondingly.  In  ai i cases  we 
used  rrPMi-l . 


* 

„ . ffjLJJ.1  ji'im. hi " . — ZZmIZSZImS  SSESS5555  i --* 
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Copy  available  to  DDC  does  not 
permit  iully  legible  reproduction 


Figure  2 Original  Image 
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Filtered  Picture  (Note  Fig.  k) 
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1.0  CAL  MAXIMUM 


Figure  7 Feature  Plane,  Local  Minimum,  Local  Maximum  Used 
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Figure  8 Filtered  Feature  (Note  Fig.  7) 
Minimum,  Local  Maximum  Used 
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Copy  available  to  DDC  does  not 
permit  fully  legible  reproduction 


Figure  9 Filtered  Picture  (Note  Fig.  8) 
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Figure  11  Feature  Based  on  Local  Jump 
Threshold  ■ 40  Used 
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Figure  12  Filtered  Feature  (Note  Fig.  10) 
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Figure  13  Filtered  Picture  (Note  Fig.  12) 
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Figure  N*  Modified  Version  of  Local  Jump 
Model  Threshold  ■ 60  Used 


Figure  l‘»  Filtered  Feature  (Note  Fig.  14) 
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Figure  16  Filtered  Picture  (Note  Fig.  15) 
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TEXTURE  EDGE  DETECTION  AND  CLASSIFICATION 
USING  MAX-MIN  DESCRIPTORS 

0.  R.  Mitchell 

I . Introduction 

The  detection  of  the  boundary  between  two  regions  of  differing  textures 
is  an  lntrigu  ng  problem.  The  ability  to  do  this  consistently  would  be  a 
valuable  tool  for  the  broader  task  of  image  segmentation  which  is  generally  an 
early  step  In  the  automatic  extraction  of  Information  from  images. 

Image  texture  Is  an  111“deflned  term  which  Implies  local  patterned  vari- 
ations over  a region.  As  an  example  of  a texture  boundary.  Fig.  1 shows  two 
texture  regions,  each  takin  from  a book  by  Bro’atz  [1].  The  texture  on  the 
left  Is  cork  and  that  on  the  right  is  wood.  Also  two  strips  in  the  picture 
have  been  multiplied  by  0.5  to  create  artificial  non-texture  boundaries  such 
as  might  be  caused  by  shadows. 

A standard  method  for  boundary  detection  is  to  perform  local  averages 
ever  two  adjacent  windows  and  compare  them,  expecting  a large  difference  if  a 
boundary  lies  between  the  two  windows.  However,  *or  this  metf >d  to  be  appli- 
cable to  texture  boundaries  some  transformation  must  oe  fi  formed  to 

change  local  texture  properties  to  grey  levels  as  «><  .»  ested  by  Rosenfeld  et. 
al . [2],  [3].  This  Is  essentially  the  method  of  te.  ture  edge  detection  to  be 
described  In  this  paper.  The  transformation  uses  a new  process  called 
"Max-Mi n Descriptors"  [4], 

I | . The  Max-Min  Transformation 

The  new  measure  for  texture  analysis  Is  based  on  the  Intuition  that  the 
important  texture  Information  for  the  human  visual  system  Is  contained  in  the 
relative  frequency  of  local  extremes  in  intensity.  The  principal  measurement 
In  this  process  Is  the  determination  of  the  local  grey  level  ftwixima  and 
minima  and  their  sizes  along  a one-dimensional  scan  direction.  The  grey  level 
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values  are  first  sent  through  a smoothing  process  which  eliminates  reversals 
of  small  amplitude,  thereby  retaining  only  the  principal  extrema.  The 
smoothing  algorithm  is  the  digital  equivalent  of  the  familiar  analog  mechani- 
cal process  known  as  gear  backlash  and  It  was  originally  described  as  a pre- 
processing method  for  character  recognition  [5]. 

The  smoothing  algorithm  is  described  as  follows:  let  x^  be  the  grey 

f h 

level  of  the  k——  point  along  the  scan  lire  and  let  y^  be  the  "smoothed"  value. 
Let  T be  the  value  of  a preassigned  threshold  parameter.  Let  us  start  with 
Yj  = Xj  and  proceed  according  to  the  algorithm  shown  below: 


IF 


T 

xk+l  " 2 


xk+l 

Vl 


k+1 


THEN 

Vl  = Vl  " \ 
Vl  " yk 
Vi  = xk+i  + 7 


Fig.  2 Illustrates  the  way  In  which  the  smoothing  process  eliminates 
reversals  of  small  amplitude,  thereby  retaining  only  the  principle  extrema. 

By  repeating  this  process  for  several  threshold  settings  a group  of  extrema 
can  be  obtained  to  characterize  the  texture.  For  the  purpose  of  edge  detec- 
tion each  extrema  Is  labeled  according  to  the  maximum  threshold  at  which  It 
still  exists,  as  shown  in  Fig  3 

Actually  grey  level  dens i ties  (log  intensities)  are  used  In  the  extrema 
detection  process.  This  changes  multiplicative  effects  (such  as  lighting 
level  or  film  processing)  Into  additive  changes  so  that  very  few  changes  occur 
in  extrema  locations  and  sizes.  The  logarithmic  transformation  Is  also 
similar  to  the  human  visual  system's  brightness  sensitivity. 
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Ml.  Edge  Detection  Results 

The  extrema  used  for  edge  detection  were  generated  by  first  detecting 
local  extrema  by  scanning  In  the  horizontal  direction  and  then  rescanning  in 
the  vertical  direction.  The  extrema  Detected  in  this  manner  from  the  image  in 
Fig.  1 are  shown  in  Fig.  4.  Brightr*  indicates  extrema  size  In  this  pic- 
ture. This  extrema  image  Is  the  input,  to  the  edge  detection  algorithm.  At 
each  point  two  windows  are  placed  as  shown  in  Fig.  5.  Then  an  edge  value  at 
that  point  is  calculated: 


E V = E 
i 


1 A 1 B ' 1 

YZ 7+TTTk 

A 1 B 1 


where  Ee.  is  th°.  total  number  of  extrema  of  size  i in  window  A and  K is  a 
A 1 

constant  introduced  to  reduce  the  effect  of  isolated  extrema. 

Figure  6 shows  the  edge  values  using  the  extrema  in  Fig.  4,  a window  size 
of  64x  48,  and  10  extrema  levels.  The  edge  value  picture  is  then  input  to  the 
extrema  detection  algorithm  again  with  only  local  maxima  above  a fixed  thres- 
hold being  kept.  This  result  Is  shown  in  Fig.  7 and  is  the  resulting  detected 
edge.  This  edge  is  shown  superimposed  on  the  original  in  Plate  1. 

A more  complex  texture  mosaic  is  shown  in  Fig.  8.  The  upper  left  section 
is  cork,  upper  center  is  wood,  lower  left  is  pebbles,  and  right  is  magnified 
paper.  The  extrema  are  shown  in  Fig.  9*  edge  values  using  a 64x  48  window  in 
Fig.  10,  and  detected  edges  in  Fig.  II.  In  Plate  2 the  detected  edges  are 
shown  superimposed  on  the  original  textures. 

Another  texture  mosaic  is  shown  in  Fig.  12.  It  consists  of  64  texture 
regions  each  64x  64,  Plate  3 shows  the  detected  boundaries  superimposed  on 
the  original.  The  window  size  used  was  32x  32. 
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IV.  Other  Appl Ications 

Shown  In  Fig.  Is  a 256  x 256  x-ray  Image.  The  detected  maximum 
extrema  are  shown  In  Fig.  15.  Due  to  the  Initial  logarithmic  transformation, 
these  extrema  should  be  little  affected  by  attenuation  factors  In  the  x-ray. 
Level  slicing  can  be  applied  to  the  extrema  picture  to  show  selected  aspects. 
For  example,  Fig.  16  shows  only  the  lowest  two  levels  of  extrema.  This  might 
be  useful  In  detecting  a lung  disease  which  shows  as  a texture  change.  Figure 
17  shows  edge  values  using  a 16  x 16  window  and  two  extrema  levels.  Plate  k 
shows  the  detected  edges  superimposed  on  the  original  x-ray. 

Shown  In  Fig.  19  Is  a 256  x 256  aerial  photograph.  The  detected  maxima 
extrema  are  shown  In  Fig.  20  ard  the  detected  edges  using  a 16  x 16  window 
and  two  extrema  levels  are  shown  In  Plate  5 superimposed  on  the  original.  A 
real  potential  exists  In  segmenting  complex  pictures  such  as  this  by  using  a 
set  of  different  window  sizes  and  a hierarchical  structure  of  segmented  edges. 
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Figure  1 

Two  texture  regions  (cork 
and  wood)  with  artificial 
boundaries  created  by 
attenuating  certain  areas 
of  the  picture. 


Figure  4 

Extrema  detected  in  Fig.  1. 
Brightness  indicates  extrema 
size.  Changes  in  illumination 
cause  some  extrema  changes  due 
to  saturation  of  logarithmic 
curve  appl led. 


Figure  5 

Window  placement  for  each 
point  in  the  picture  for 
edge  value  determination. 
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Figure  6 


Edge  values  from  Fig.,  3 
using  algorithm  described 
in  the  text  and  a window 
size  of  6*4  x *48. 


Figure  7 

Texture  edge  detected  in 
Fig.  1 


Figure  8 

Another  texture  mosaic*  The 
upoer  left  section  is  cork, 
upper  center  is  wood,  lower 
left  is  pebbles,  and  right  is 
magnified  paper* 
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Figure  9 

Extrema  detected  in  Fig,  8 


Figure  10 

Edge  values  for  Fig*  8 using 
a 6*4  x *48  wi  ndow. 


Figure  11 

Texture  edges  detected  in 
Fi g„  8. 
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Figure  I?. 


A more  complex  texture  mosaic 
It  consists  of  texture 
regions  each  6A  x 


Figure  13 

Texture  edges  detected  in 
Fig.  12. 


Figure  1 A 


A 256x  256  x-ray  image. 
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F i (jure  15 

Maximum  extrema  detected  in 
Fig.  14 


Figure  16 

Lowest  two  levels  of  maximum 
extrema  detected  in  Fig.  14. 


Figure  17 

Edge  values  for  Fig.  14  using 
a 16  x 1 6 wi ndow. 


i*. 
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Figure  19 

A 256 x 256  aerial  Image. 
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Plate  1 

Superposition  of  texture 
regions  (Fig.  1)  and  detected 
edge  (Fig.  7). 


Plate  2 

Superposition  of  texture 
regions  (Fig.  8)  and  detected 
edges  (Fig.  11). 
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Plate  3 


Superposition  of  texture 
regions  (Fig,  12)  and  detected 
edges  (Fig.  13). 


Plate  k 

Superposition  of  x-ray  (Fig. 
1*0  and  detected  texture  edges 
(Fig.  18). 


Plate  5 

Superposition  of  aerial  scene 
(Pig*  19)  and  detected  texture 
edges  (Fig.  20). 
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FOURIER  SHAPE  DESCRIPTORS 
T.  Wallace  and  P.  A.  V/lntz 

The  Fourier  descriptor  (FD)  Is  one  method  of  describing  the  shape  of  a 
planar  figure.  Given  a figure  In  the  complex  plane,  the  contour  can  be  traced, 
yielding  a complex  function  of  time.  If  the  contour  Is  traced  repeatedly, 
the  periodic  function  which  results  can  be  expressed  In  a Fourier  series. 
Granlund  [1]  defines  the  FD  of  a contour  as  the  coefficients  of  this  Fourier 
series. 

To  Implement  this  method  of  shape  description,  It  is  necessary  to  sample 
the  contour  at  a finite  number  of  points.  Since  the  discrete  Fourier  trans- 
form of  a sequence  gives  us  the  va.ues  of  the  Fourier  series  coefficients  of 
the  sequence,  assuming  It  to  be  periodic,  using  an  FFT  algorithm  satisfies 
the  definition  above.  The  computational  advantages  of  the  FFT  are  well  known. 

The  goal  of  this  work  Is  to  classify  the  shapes  of  objects  using  their 
Fourier  descriptors.  The  operations  of  rotation,  scaling,  and  moving  the 
starting  point  are  easily  Implemented  In  the  frequency  domain  by  simple 
arithmetic  on  the  frequency  domain  coefficients.  While  shapes  may  be  compared 
In  the  space  domain,  the  procedures  required  to  adjust  their  size  and  orienta- 
tion are  computationally  very  expensive.  Normally  an  Iterative  type  of  algo- 
rithm Is  employed,  which  searches  for  an  optimum  match  between  the  unknown 
shape  and  the  test  set. 

The  goal  of  classification  using  Fourier  descriptors  Is  to  develop  an 
algorithm  which  will  normalize  the  size  and  orientation  of  a shape  before  any 
comparisons  to  test  shapes  are  made.  If  this  can  be  accomplished,  the  clas- 
sification process  becomes  a simple  clustering  problem  with  no  Iterative 


searches  to  contend  with. 
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Let  the  output  vector  of  the  FFT  be  denoted  (A (0) ,A(1 ) , . . . ,A(N) ) . The 
normalization  procedure  is  as  follows. 

For  position  normalization  simply  set  A(0)  equal  to  some  constant,  or 
disregard  it  entirely,  since  it  contains  no  information  other  than  the  posi- 
tion of  the  contour  relative  to  the  origin  of  the  complex  plane. 

For  magnitude  normalization,  multiply  all  the  coefficients  by  a factor 
chosen  such  that  the  magnitude  of  some  reference  coefficient  is  a specified 
value.  For  most  contours  traced  in  the  counterclockwise  direction,  A(l)  will 
be  the  largest  coefficient,  so  the  obvious  procedure  is  to  set  MAG  (A ( 1 ) ) 
equal  to  unity. 

The  other  shape-preserving  operations  which  must  be  performed  in  the 
normalization  procedure  affect  orientation  and  starting  point.  To  rotate  the 
contour,  all  coefficients  may  be  multiplied  by  EXP  ( J6) . Where  6 is  the  angle 
of  rotation.  To  move  the  starting  point  from  which  the  contour  is  traced, 
multiply  the  Ith  coefficient  by  EXP  (JIT),  where  T is  the  distance  along  the 
contour  which  the  starting  point  is  to  be  moved  (the  length  of  the  contour  is 
defined  to  be  2PI). 

If  the  unknown  shape  is  an  exact  match  to  a test  shape,  normalizing  the 
orientation  and  starting  point  is  straightforward.  Since  there  are  two  per- 
missible operation  affecting  the  phase  of  the  coefficients,  the  reference 
orientation  and  starting  point  can  be  defined  by  specifying  the  phase  of  two 
selected  coefficients,  such  as  A(l)  and  A(2). 

When  the  unknown  shape  is  not  a match  to  any  of  the  test  shapes,  and  we 
would  like  to  determine  which  of  the  test  shapes  it  is  closest  to,  the  problem 
becomes  more  difficult.  The  same  procedure  can  be  followed,  but  if  the  dif- 
ference between  two  similar  shapes  lies  mainly  in  the  particular  coefficients 
used  in  normalization,  an  optimum  orientation  and  starting  point  may  not  be 


■■  ■ -- 
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achieved.  A simple  application  of  Parseval's  theorem  shows  that  the  square 
of  the  Euclidean  distance  in  the  frequency  domain  corresponds  to  a point  by 
point  mean  square  error  in  the  space  domain.  It  is  evident  that  a small  error 
in  rotation  or  starting  point  normalization  may  cause  the  whole  normalization 
procedure  to  break  down. 

It  can  be  easily  shown  that  in  the  case  of  contours  with  bilateral 
symmetry,  if  the  starting  point  is  chosen  to  lie  on  the  axis  of  symmetry,  the 
phases  of  all  the  coefficients  will  be  equal  to  zero,  or  180  degrees.  This 
indicates  that  there  is  less  information  contained  in  the  phases  of  the  FD 
than  in  the  magnitudes,  so  a classification  using  the  magnitudes  alone  should 
be  effective.  If  the  contours  to  be  classified  are  perfectly  bilaterally 
symmetric,  there  is  no  problem  in  normalizing  the  FDs,  since  the  procedure 
above  will  work  perfectly.  However,  in  practice,  this  is  highly  unlikely,  due 
to  noise  and  quantization  error. 

A set  of  aircraft  silhouettes  was  used  to  test  the  magnitude  classifica- 
tion method  for  bilaterally  symmetric  contours.  Eighteen  planes  were  digi- 
tized, the  contours  traced,  and  the  FDs  computed.  Figures  1 thorugh  18  show 
the  contours  used  in  the  experiment.  Figure  19  shows  the  Euclidean  distances 
In  the  frequency  domain  between  the  magnitudes  of  the  FDs.  Planes  1 through 
9 are  rear  engine  types,  planes  10  through  12  are  two  engine  types,  and 
planes  13  through  18  are  four  engine  types.  The  original  contours  contained 
128  points,  but  only  the  first  48  frequency  domain  components  were  used  In  the 
classification;  the  higher  coefficients  contain  information  about  quantization 
and  very  small  detail,  and  have  little  effect  on  the  distances. 

If  the  plane  being  classified  Is  deleted  from  the  test  set,  and  Its  type 
is  assumed  to  be  that  of  the  nearest  plane  in  the  set,  only  one  error  occurs. 
Plane  11  is  slightly  nearer  to  Plane  15  than  to  Plane  12.  It  should  be  noted 
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however,  that  there  are  only  three  planes  In  the  two  engine  category,  so  we 
probably  do  not  have  a very  complete  test  set.  It  should  also  be  noted  that 
classifying  purely  on  engine  location  is  not  synonymous  with  matching  similar 
shapes;  factors  such  as  wing  sweep  and  length  to  wingspan  ratio  are  neglected. 
The  influence  of  some  of  these  latter  factors  is  evident  from  an  examination 
of  the  original  contours.  Plane  1,  for  example,  while  not  mi sclass if  led,  has 
a much  smaller  length  to  wingspan  ratio  than  the  other  rear  engine  planes. 

This  ratio  is  near  unity,  which  is  similar  to  most  of  the  four  engine  planes. 
This  explains  why  Plane  1 is  not  very  close  to  any  of  the  other  planes,  com- 
paratively speaking.  It  also  explains  why  Plane  1 is  closer  to  the  four 
engine  group  than  is  any  other  two  engine  model. 
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Figure  2 
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Figure  7 
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Figure  8 
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Figure  9 
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Figure  10 
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Figure  12 
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figure  13 
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Figure  14 
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Figure  15 
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Figure  16 
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Figure  17 
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Figure  18 
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SYNTACTIC  SCENE  ANALYSIS 
K.S.  Fu,  P.H.  Swain,  R.Y.  Li,  J.  Keng,  T.S.  Yu 

The  bulk  of  the  pattern  recognition  techniques  employed  to  date  for  the 
machine  analysis  of  aircraft  or  satellite  gathered  Image  data  have  concentrated 
on  the  spectral  characteristics  of  the  data,  i.e.,  "color"  in  a generalized 
sense.  Relatively  little  use  has  been  made  for  Image  understanding  purposes 
of  the  information  contained  In  the  spatial  relationships  in  the  data.  The 
goal  of  this  research  is  to  characterize  the  spatial  information  content  of 
such  imagery  in  terms  of  (I)  syntactic  image  models  [1,2, 3, 4]  and  (2)  statls- 
tical  dependency  models  of  context  [5,6] • As  a subtask,  we  shal 1 also  study 
change  detection,  a particularly  significant  practical  aspect  of  spatial  scene 
information. 

(i)  Syntactic  image  modeling. 

The  immediate  thrust  of  this  activity  is  to  extend  the  work  of  Fu  and 
Brayer  [4],  who  developed  grammars  for  describing  Image  entities  such  as  roads 
and  cloud/shadow  pairs.  Linguistic  rules  in  terms  of  tre^  or  web  grammars  can 
be  constructed  which  characterize  such  properties  as  shape,  size,  and  texture 
as  well  as  spatial  relationships  such  as  "surrounded  by"  or  "near  by".  Di- 
rectional references  can  also  be  explored  to  locate  classes  of  large  areas 
where  no  definite  shapes  exist,  such  as  those  found  in  land-use  classification. 
A hierarchical  or  tree  graph  model  can  be  built  to  contain  the  spatial  distri- 
butions of  all  classes  in  the  entire  scene.  We  are  investigating  the  extent 
to  which  syntactic  rules  in  general,  with  tree  structures  in  particular,  can 
be  used  for  generating  shape  descriptors  for  recognition  purposes  in  the  con- 
text of  remote  sensing  data  where  traditional  purely  spectral  approaches  would 
not  achieve  satisfactory  results. 

At  present,  we  have  selected  imagery  of  Lafayette,  Indiana  as  a training 
data  set.  Out  of  seven  primary  ground  cover  types,  we  have  chosen  river  and 
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highway  because  of  their  relatively  homogeneous  spectral  characters  and  their 
linear  shape  patterns.  A set  of  syntactic  rules  Is  to  be  inferred  and  sub- 
sequently applied  to  other  data  sets  to  test  its  generality. 

(ii)  Statistical  dependency  models  of  context 

The  traditional  simple  decision  procedure  is  to  select  a state  (i.e., 
classify  a point  as)  belonging  to  the  member  of  nature  space  0-  (1,2,. ..r) 
which  will  minimize  the  Bayes  risk 

Z L(6,a)p(x|0)G(e)  (D 

0-1 

where  L(0,a)  Is  the  loss  function  for  0e0  and  aeA  - 0.  The  loss  function  L is 

?ve>'aged  over  i prior  distribution  of  6.  G(6)  is  the  prior  probability  of 

occurrence  of  class  0.  To  Introduce  context  into  the  model  we  use  a compound 

decision  rule.  In  this  case  we  have  13  - (0 , as  a set  of  states  of 

*n  i n 

nature  and  a set  x - (x,,...x  ) of  vector-valued  random  variables  to  be  class! 
~n  I n 

fied  join' ’y.  We  must  choose  a set  of  actions  gn  » (aj,...an)  to  minimize  the 
expected  loss  for  a compound  loss  function  L(0n,an)  where  l(0n»j*n)  is  the 
average  of  the  losses  Incurred  in  each  of  the  n decisions. 

L(s„-sn)  -k  ,£ . HA-V  (2) 

k-1 

Lk(i,j)  is  the  loss  for  kth  decision  If  ak  - j and  0R  - 1 . The  optimal  com- 
pound decision  rule  which  minimizes  the  compound  Bayes  risk  chooses  akeA, 
which  minimizes 

I L(ek,ak)p(!in|8n)G(9|i)  (3) 

V 

where  x is  the  set  of  recognition  vectors  for  all  cells  of  interest. 

-n 

Note  that  x is  an  n-dimenslonal  vector  of  vectors.  Further,  p(x  |9,  ) is 
~n  n * 

a density  function  defined  on  this  vector  space  which  in  practice  can  not  be 
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evaluated  without  making  some  simplifying  assumptions.  We  shall  list  the 

assumptions  and  state  the  final  decision  rule  here  without  giving  detailed 
derivations. 

Consider  Figure  1. 


Fig.  1.  Labeling  of  neighbors  of  cell  k. 

Adjacent  cells  are  defined  as  two  cells  with  a common  side.  Our  first  assump- 
tion is  that  contextual  relationships  between  nonadjacent  cells  are  negligible. 
The  second  assumption  is  that  the  pattern  vector  xR  of  a cell  ek  is  a function 
only  of  9k,  the  nature  of  the  cell. 

The  use  of  four  neighbors  (kj , k 2,  k^,  k^)  as  contex  Information  has  been 
developed  by  Welch  and  Salter  [6].  The  final  decision  rule  under  the  above 
assumptions  Is  to  choose  acA  to  minimize 
r 

i L(9k,.)p<»  |e  )G(e  ) it  e p(x  |e  )p(e  |e  ) («) 

6k  ' J-l  9^  K|  k,  Kl  k 

We  made  one  more  assumption  to  Incorporate  eight  neighbors  of  cell  k as  con- 
text In  the  algorithm  for  testing  purposes.  The  additional  assumption  Is  fhr.t 
the  contribution  due  to  four  adjacent  cells  of  k Is  Independent  of  that  due  to 
four  cells,  namely,  k^,  kg,  k^,  kg.  The  decision  rule  is  then  to  choose  aeA 


to  minimi ze 


«7 


„£  LCek.a)p(xk|ek) 

Lk  

8 

p(*klek)  n z 


k 


-con 


(5) 


where  the  "adj"  term  denotes  the  contribution  due  to  the  four  adjacent  cells 
and  the  "con"  term  denotes  that  due  to  four  corner  cells. 

Both  of  these  context-dependent  decision  procedures  have  been  programmed 
and  applied  to  mul tlspectral  LANDSAT  data.  Initial  results  appear  to  verify 
that  use  of  context  can  significantly  Improve  classification  accuracy.  However, 
the  cost  in  terms  of  computation  required  Is  substantial.  I„  fact,  the 
marginal  Improvements  observed  In  going  to  8 neighbors  as  compared  to  A may  not 
be  worth  the  added  computational  load.  The  potential  Impact  of  parallel  pro- 
cessing on  this  problem  also  needs  to  be  considered.  It  may  also  be  p, ssible 

to  recast  the  equations  In  a recursive  form  which  will  reduce  the  required 
computations. 
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"iSMSE .-STSS”  « 

°*  R.  Mitchell  and  P.  l.  Chen 

I.  INTRODUCTION 

;r  (ARpAReport »»>.  w,areusIn9 

r,  c filtering  techniques  to  remove  -I tlpllcetlve  noise  effects  such 

:::  7d  * ^ — - -*■-«-  - ^ 

se  have  been  revised.  The  filtering  function  Is  of  the  form 
“ •"  «,  Is  the  p«er  spectrum  of  the  signal  and  Df  the  noise.  '2 

- 

ed  'n  the  previous  report. 

"•  COMPUTER  SIMULATION 

--  - ... 

V 5°  thi"  the  *ma9e  s(x,y)  Is  formed  by: 

s(x.y)  - I rfx.y)  t(x.y)  ♦ Kl-t(x.y)) 

0 1 r(x,y)  <_  l 
0 1 t(x,y)  <_  I 

Th's  represe"'a  «•  effect  of  cloud  transmission  [t(x  yl]  aro  „ „ 

[r(x  v)l  jinri  | , _ 1 ’y'J’  9 round  reflectance 

1 1 ,y,J  and  cloud  reflection  [|-t(x  y)l  of  »„  tri 

' ,y;j  of  Incident  sunlight  [I] 

linear  filtering  Is  performed  on 

'°9  " ' SU-V)1  ' 'OS  " * * 'og  [t(x.y„  ♦ ,og  , 

-h  — e signal  and  noise.  The  signal  estate  Is  then  obtained  by 
exponentiating  the  filter  output  and  Inverting  the  grey  levels. 
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III.  SIMULATION  RESULTS 

Results  of  the  simulation  using  64x64  pictures  are  shown.  For  comparison 
purposes,  the  mean  and  standard  deviation  of  the  noisy  and  filtered  pictures 
were  normalized  so  that  U * 128  and  0 ■ 48  on  a display  scale  of  0 to  255 • 

Fig.  1(a)  is  a noisy  signal  with  I * 15,  r(x,y)  - 2/3  In  background  and  4/5  in 
foreground,  and  t(x,y)  is  white  noise  uniformly  distributed  between  0.4  and 
0.6.  The  filtered  result  is  shown  in  Fig.  1(b).  Fig.  2(a)  is  another  noisy 
picture  with  the  noise  increased  to  range  from  0.02  to  1.0.  Fig.  2(b)  shows 
the  filtered  result.  Fig.  3(a)  is  a noisy  signal  with  I * 15,  r(x,y)  * Q in 
background  and  4/S  in  foreground,  and  the  noise  ranges  uniformly  from  0.4  to 
0.6. 

In  Fig.  4(a)  the  same  noise  as  used  in  Fig.  2(a)  was  low  pass  filtered 
before  it  was  used.  Fig.  4(b)  shows  the  resulting  filter  output.  The  signal 
edges  (high  frequencies)  are  retained  because  the  noise  has  no  components  at 
these  frequencies. 

Work  is  now  in  progress  in  applying  a three  dimensional  extension  t.hls 
filter  to  cloudy  LANDSAT  data. 
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Fig.  1(a)  Signal  with  multiplicative  and  reflective  noise.  Noise  ranges 
from  0.4  to  0.6. 
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Fig.  1(b)  Filtered  output  from  Fig.  1(a),  displayed  with  same  mean 
and  contrast  as  Fig.  1(a). 
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Fig.  2(a)  Same  picture  as  In  Fig.  1(a)  except  the  multiplicative  noise 
Is  Increased  to  0.02  to  1.0. 


Fig.  2(b)  Filtered  output  from  Fig.  2(a) 


Fig.  3(a)  Same  noFse  as  Fig.  1(a),  signal  level  Is  further  reduced 
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Fig.  3(b)  Filtered  output  from  Fig.  3(a) 
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Fig.  4(a)  Same  picture  as  In  Fig.  2(a)  except  the  noise  Is  first  Ideal 
low  pass  filtered  to  8 cycles/picture  width. 
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Fig.  J»(b)  Filtered  output  from  Fig.  3(a) 
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MISSiLE  TRACKING 
R.  Gunshor  and  T.S.  Huang 

This  project  is  motivated  by  the  real-time  vldfjo  tracking  problems  at  the 
U.S.  Army  White  Sands  Missile  Range.  The  missile  movie  frames  were  supplied 
to  us  by  Dr.  Alton  Gilbert  of  WSMR. 

Our  preliminary  experimentation  [l]  has  Indicated  that  it  Is  possible  In 
many  cases  to  detect  and  locate  a missile  by  processing  single  scan  lines. 

This  opens  up  the  possibility  of  very  fast  algorithms  which  scan  selectively 
instead  of  full  frame. 

Solid  state  Imagors,  such  as  surface  acoustical  wave  and  charge-coupled 
devices,  are  particularly  suitable  for  selective  scanning.  In  this  report,  we 
present  some  very  preliminary  results  of  using  a SAW  linear  scanner  to  scan  one 
line  of  a missile  frame.  The  purpose  of  the  experiment  is  to  test  the  capabil- 
ities of  the  SAW  scanner  under  development  at  Purdue  with  regard  to  the  missile 
tracking  project. 

A frame  from  a motion  picture  of  a missile  flight  at  WSRM  was  scanned 
using  a surface  acoustic  wave  device.  The  results  are  shown  in  Fig.  i.  Oscil- 
lograms A,  B,  and  C represent  the  gate  output  at  2u».  The  envelope  amplitudes 
are  proportional  to  the  iiiuminatlon  levels  along  the  scan  path.  A represents 
a scan  of  the  background  sky  above  the  missile;  B Is  a scan  across  the  main 
missile  body  showing  a bright  area  to  the  left,  followed  by  a dark  shadow 
region;  C is  a scan  across  the  vapor  trail  of  the  missile  showing  the  contrast 

between  the  vapor  trail  and  the  background  sky. 

The  device  employed  in  this  experiment  was  a monolithic  convolver  fabri- 
cated by  evaporation  of  CdSe  onto  a lithium  niobate  substrate.  The  convolver 
efficiency  of  this  particular  device  was  only  -75  db  (output  divided  by  prod- 
uct of  input  powers).  The  state  of  the  art  for  SAW  convolvers  is  about  -20  db. 
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The  low  ef>  ency  of  this  particular  device  meant  that  a relatively  long 
scanning  pj  (0.2  psec)  had  to  be  used  to  obtain  an  acceptable  slgnal-to- 
nolse  ratio.  For  a propagation  velocity  of  3.5  X 105  cm/s  the  spatial  extent 
of  this  pulse  Is  0.7  mm;  this  Is  thus  the  order  of  resolution  for  this  experi- 
ment. Also,  the  acoustic  beam  width  Is  2 mm;  this  results  In  some  averaging 
along  the  length  of  the  missile.  Despite  the  use  of  this  non-optimum  scanning 
device,  the  main  features  of  the  picture  are  Identifiable  from  the  oscillograms. 
The  output  signals  shown  In  the  figure  represent  the  rf  output  of  the  convolver 
gate.  These  signals  (at  180  MHz  for  this  device)  can  be  applied  directly  to  a 
SAW  correlator  for  real-time  processing  of  the  image  signals. 
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Manufacturer 
Beehive  Elect. 
Tex.  Inst. 
Digi-Data 

DEC 

DEC 

Fabritek 

Versatek 

Comtal 

Data  Printer 
True-Data 
Tektronix 
DEC 


QTY 

3 

2 
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FACILITIES 

Description 

nSuper-Bee"  Terminals 

"Silent  700"  Terminals 

Industry  standard  magnetic  tape  system; 

2,  9“track  and  1,  7“track  drives;  one  each 
NRZI  and  phase-encoded  formatters/controllers 

Dual -drive  DEC tape  unit 

RP03  disk  drive  (40  million  characters) 

96K-word  auxiliary  memory  system 
(64K  bought  by  ARPA,  32K  by  NASA) 

Electrostatic  matrix  printer 

Color  picture  display 

132  column,  600  L.P.M.  line  printer 

Punched  card  reader 

Model  4010,  graphics  display 

PDP-11/45  computer  system;  system  includes: 
32K  memory 

FPP-11  floating  point  processor  (NSF  money) 
H960  extension  mounting  cabinet 
3 - small  peripheral  mountings  blocks  (DD-ll) 

1 UNI  BUS  repeater/expander 

Dilll,  16-line  terminal  multiplexor 

KWll-p  programmed  clock 

"ANTS"  - type  PDP-1 1/IMP  interface 


Note:  Our  PDP-11/45  Is  currently  operating  under  the  UNIX  system 
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